METHODS FOR THE PURIFICATION AND AQUEOUS FIBER SPINNING 
OF SPIDER SILKS AND OTHER STRUCTURAL PROTEINS 

STATEMENT OF GOVERNMENTAL INTEREST 

5 The present invention may be used or licensed by the United States Government 

for Governmental purposes without the payment of any royalty. 

FIELD OF THE INVENTION 

The present invention relates to methods for purifying and spinning spider silks 
10 and other structural proteins. Specifically, organic acids are used to lyse recombinant 
cells or other biological samples (such as non-recombinantly derived cells), and 
significantly enrich the purity and yields of structural proteins by hydrolyzing many of 
the macromolecules, while leaving the structural proteins intact. In the case of silk 
proteins, the resulting lysate is further purified by ion-exchange or affinity 
15 chromatography and processed into an aqueous-based mixture for fiber spinning. 

BACKGROUND 

Spiders produce a number of silks for different functions and are therefore useful 
organisms to produce a variety of structural proteins. The structural fibers of the golden 

20 orb-weaver spider (Nephila clavipes), are extremely strong and flexible, and are able to 
absorb impact energy from flying insects without breaking. Dragline silk fibers dissipate 
energy over a broad area and balance stiffiiess, strength and extensibility. In addition, 
silk proteins have very low antigenicity. Therefore, silk fibers are well suited for light 
weight, high performance fiber, composite and medical applications. The composition of 

25 these proteins is mainly glycine, alanine, and other short side chain amino acids, which 
form anti-parallel beta-pleated sheets by hydrogen bonding and hydrophobic interactions; 
Lucas et aL, Discovery 25:19 1964. Many spider silks are resistant to digestion by 
proteolytic enzymes; Tillinghast and Kavanaugh, Journal of Zoology 202:212 1977, and 
insoluble in dilute acids and bases; Mello et aL, American Chemical Society Symposium 

30 Series 544, Silk Polymers: Materials Science and Biotechnology pp 67-79, 1995. Spiders 
are not capable of producing sufficient quantities of proteins to enable a practical use of 
their potential. To solve this problem, recombinant spider silks have been expressed in 
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E.coli; Arcidiacono et ai. Applied Microbiology and Biotechnology 49:31 1998; 
Fahnestock and Irwin, Applied Microbiology and Biotechnology 47:23, 1997; 
Fahnestock and Irwin, Applied Microbiology and Biotechnology 47:33 1997; Lewis et 
al, Protein Expression and Purification 7:400, 1996; Prince et al, Biochemistry 
34: 10879 1995. However, the purification and preparation of a protein for fiber spinning 
has been particularly difficult due to the solubility characteristics and unique properties of 
spider silk and other structural proteins. 

Native Nephila clavipes spider dragline fiber has been partially solubilized in 
hexafluoroisopropanol (HFIP) and dried to a film. A 2.5 % (w/w) solution of the fikn in 
HFIP was used for spinning; Jelinski et al, Macromolecules 31:6733 1998. The spinning 
was conducted with a syringe pump at 6uL/s by forcing the HFIP solution through the 
spinneret into a coagulation bath. 

Affinity chromatography has been used for purification by binding to an 
engineered tag in the recombinant protein while washing away bacterial proteins; 
Arcidiacono et al. Applied Microbiology and Biotechnology 49:31 1998; Fahnestock 
and Irwin, Applied Microbiology and Biotechnology 47:23 1997; Lewis et al, Protein 
Expression and Purification 7:400 1996; Prince et ai. Biochemistry 34:10879 1995. One 
commonly used tag is a hexa-histidine tag, that binds with high affinity to a nickel 
affinity resin. After washing away the bacterial proteins, the tagged recombinant protein 
can be eluted firom the resin. There are several disadvantages to this method: 1) large 
volumes of denaturing buffers are required, involving multiple steps and time; 2) not all 
target protein is recovered; 3) other bacterial proteins remain, often requiring additional 
purification (i.e., high-performance liquid chromatography (HPLC)); 4) the method is not 
easily scaled-up; 5) and the presence of an affinity tag on the recombinant protein may 
increase its antigenicity and interfere with the necessary molecular alignment required for 
high strength fibers. Accordingly, there is a continuing need to develop new methods for 
the purification of structural proteins, spinning of silk fibers lacking the engineered tag 
and enabling the assembly of macromolecular structures without potential interferences. 
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SUMMARY OF THE INVENTION 

As a solution to the above-related deficiencies in the prior art, the present 
invention contemplates using organic acids to purify recombinant spider silks or other 
non-recombinant structural proteins from E. coli bacteria v^hile removing the unw^anted 
5 bacterial proteins. The invention is based on the unique solubilization and stability 
characteristics of these proteins, which are resistant to acid hydrolysis for prolonged 
periods of time at room temperature, while many globular proteins are not. Purified 
protein solutions can be processed into a spinnable aqueous-based mixture for the 
production of fibers. The present invention also contemplates an aqueous protein 
10 spinning method that closely mimics the natural spinning process of the spider and has 
the potential to produce fibers with properties that may resemble or improve upon those 
of natural silk fibers. The present invention represents the first known example of an 
P aqueous process for the spinning of silk proteins into fibers. Furthermore, this invention 

is the only known report, to date, of spinning recombinant silk proteins into fibers. The 
j^f 15 present invention displays numerous advantages over the background art, including a 
rU purification method with organic acids containing fewer steps, requiring less time and 



believed that lower protein recovery rates by the traditional methods are caused, in part, 
from incomplete binding of the protein to the affinity resin. Such traditional techniques 
include, but are not limited to, ion exchange chromatography and affinity 
chromatography. The inability of these proteins to bind to the resin is most likely due to 

25 a high degree of secondary structure even in the presence of high concentrations of 

denaturant. Sample purity from the present invention has been obtained in the range of 
94-97% as determined by amino acid analysis {see Examples 1 and 2, infra). The current 
art results in a wide and inconsistent range of purity ranging from 70% {Prince et aL, 
supra), to 99% {Lewis et al, 1996, supra). While high sample purity is possible using 

30 current art by affinity chromatography, the presence of the histidine affinity tag 

significantly increases the antigenicity of the protein and adversely affects the properties 



smaller volumes of reagents. The present invention also results in better recovery of 
protein at a higher purity. For example, the (SPl)^ protein can be recovered at a level of 

150 mg/L, compared to the 7mg/L recovery rate by the current art {see Prince et a/., 



i: J 20 supra). While not limited to any mechanism by which a recovery is achieved, it is 
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of fibers, films, or other materials by disrupting the proper molecular orientation required 
within the material. Also, in many cases the current art results in samples still 
contaminated by other bacterial proteins, requiring additional purification such as HPLC 
(Prince et al ; Lewis et ai, supra). Finally, the methods of the present invention are 
5 easily scaled-up, and fibers are spun in an environmentally benign solution reducing 

hazardous waste accumulation and cost. For example, the present invention contemplates 
the spinning of silk proteins in an environmentally innocuous aqueous based system. In 
one embodiment of the present invention, a solution of an organic acid is used to effect 
the lysis of bacteria and initiate purification of recombinant silks and native structural 

10 proteins. Globular proteins are hydrolyzed while the silk protein remains intact. Silk 
proteins remain and are concentrated into an aqueous-based mixture for fiber spinning. 
The embodiment may comprise the following steps: a) resuspension of the cell pellet in 
concentrated organic acid and dilution to 2.3N in water (H-/-denaturant and/or surfactant) 
to form a homogeneous mixture; b) incubation at room temperature 1 hour with stirring 

15 and centrifugation to remove cell debris; c) reduction of volume, 10-100 fold by 
ultrafiltration and removal of insoluble material by centrifugation; d) dialysis and 
removal of insoluble material by centrifiigation; e) purification by ion exchange 
chromatography and dialysis into processing buffer; f) concentration of solution to 1 1- 
40% (w/w) protein by ultrafiltration and spinning solution into fibers. While this 

20 embodiment is given for guidance, those of skill in the art may choose to add or delete 
certain steps while remaining within the spirit and scope of the present invention. For 
example, the purification methodology may be employed with or without the spinning of 
the fiber solution. Several native and recombinant structural proteins have been purified 
by this method. Any biological sample containing a structural protein of interest, native 

25 or recombinant, is amenable to the methodology outlined in the invention. Examples of 
biological samples may include, but are not limited to, E. coli cells, other bacterial cells, 
eukaryotic cells, a medium in which a structural protein has been secreted, bone, tissues 
or organs. And while many variables have been examined and optimized throughout the 
process, each variable and optimization exemplify variations of the overall general 

30 method. Choosing among the various parameters is highly dependent on the protein 
being prepared. Table 1 below lends guidance to those of skill in the art. 
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Variables Explored 


Conclusion 


Lysis 




1. Type of acid 


Protein: (4+1)^ Acid: Propionic 

Protein: (Spl)^ Acid: Formic 

Protein: NcDS Acid: Formic 
Protein: OmpF Acid: Valeric 
Protein: Recognin Acid: Valeric 


2. Volume acid/g cells 
(2-100 ml/g) 


Increased acid volume generally decreases 
purity 


3. Acid strength 
(0.5.23N) 


Full strength is best (23N Formic, 13N 
Propionic) for lysis. 


4. Length of lysis (30 min-ovemight) 


1 hr is preferred 


5. Temperature of lysis (250C-370C) 


No effect 


6. Lysis under denaturing conditions 


Solubility improves, purity decreases 


7. Lysis in the presence of detergents 


No effect on purity 


Puriflcation 




1 . Lysate purification by 


Chromatography successfully purifies target 
faffinitv ion exchanee") 


Processing 




1 . Urea concentration in the processing 
buffer H 60 mM v«; 1 IVfi 

UUllVi I 1 \J v 1111 VJ. Vo 1 XVI 1 


IM urea improves solubility slightly 


iVJliiV^ oLldl^lii KJL tilC Ul UW/v/Ooiii^ 

buffer (20-100 mM NaCl) 


Tnrrpfl^inp T^fiCl ronrentration causes 
precipitation 


Spinning 




1 jinnf*nii<i-V>a^pH miYtnTP nrnfpiTi 

1 . OUlll £l\J ti^Wtio Lrdo^U llllAlUl^ L/iV/twlll 

concentration (1 1-35%) 


Protein denendent 


2, Age of aqueous-based mixture (0-5 
days) 


Spinnability changes as the aqueous-based 
mixture ages 


3. Temperature during aging (4-3 O^C) 


Higher temperatures accelerate changes in the 
solution behavior (i.e. spinnability and 
solubility) 


4. Coagulation bath (70-90% 
methanol) 


Methanol percentage affects speed of fiber 
formation, fiber behavior 







A variety of embodiments are contemplated- In one embodiment, the present 
invention contemplates a method, comprising: a) providing: i) a biological sample 
comprising one or more structural polypeptides; and ii) an acid; b) treating said sample 
5 with said acid under conditions such that said one or more polypeptides is recovered in a 
solution. A variety of structural peptides are contemplated, including but not limited to 
polypeptides selected from SEQ ED NO.: 2, SEQ ID NO.: 4, SEQ ID NO.: 6, SEQ ID 
NO.: 8, SEQ ID N0.:9, and SEQ ID NO.: 11. the peptides may be recombinant or native 
polypeptides. 

10 A variety of acids are contemplated. Organic acids are preferred. In one 

embodiment, the present invention contemplates one or more organic acids selected from 
formic, acetic, propionic, butyric, and valeric acids. 

It is the goal to produce fibers. Therefore, in one em.bodiment, the method further 
comprises the step of manipulating said solution under conditions such that insoluble 

15 fibers are produced. Indeed, the present invention specifically contemplates the fibers 
produced according to the above-described process. 

The present invention specifically contemplates methods wherein recombinant 
structural proteins are manipulated. In one embodiment, the present invention 
contemplates a method, comprising: a) providing: i) host cells expressing one or more 

20 recombinant structural polypeptides, and ii) a solution comprising an organic acid; b) 
treating said host cells with said solution to create a mixture; c) removing insoluble 
material from said mixture; and d) recovering said one or more recombinant polypeptides 
in a solution. Again, a variety of peptides are contemplated. In one embodiment, one or 
more polypeptides are selected from SEQ ID NO.: 2, SEQ ID NO.: 4, SEQ ID NO.: 6, 

25 SEQ ID NO.: 8, SEQ ID N0.:9, and SEQ ID NO.: 11. Again, a variety of acids are 

contemplated, including but not limited to organic acids selected from formic acid, acetic 
acid, propionic acid, butyric acid, and valeric acid. 

To produce fibers, the method involves manipulation of said recovered one or 
more recombinant polypeptides in said solution under conditions such that insoluble 

30 fibers are produced. The present invention specifically contemplates the fibers 
themselves produced according to the above-described process. 
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A variety of host cells are contemplated for recombinant production. Thus, in one 
embodiment the present invention contemplates a method, comprising: a) providing: I) 
bacterial cells expressing one or more recombinant structural polypeptides, and ii) a 
solution comprising an organic acid selected from formic acid, acetic acid, propionic 
5 acid, butyric acid, and valeric acid; b) treating said bacterial cells with said solution to 
create a mixture; c) removing insoluble material from said mixture; and d) recovering 
said one or more recombinant polypeptides in a solution. As noted above, a variety of 
peptides are contemplated, including but not limited to one or more polypeptides is 
selected from SEQ ID NO.: 2, SEQ ID NO.: 4, SEQ ID NO.: 6, SEQ ID NO.: 8, and SEQ 
10 ID NO.: 11. 

To produce fibers, said recovered one or more recombinant polypeptides are 
manipulated under conditions such that insoluble fibers are produced. In a preferred 
embodiment, said manipulating comprises: a) concentrating said recovered one or more 
recombinant silk polypeptides to create a concentrated solution; and b) forcing said 
15 concentrated solution through a spinneret. The present invention specifically 

contemplated the fibers themselves v^hich are produced according to this process. 

In sum, the present invention contemplates a method, v^hich comprises providing 
a biological sample composed of a polypeptide and an acid, and manipulating the 
biological sample under conditions such that the polypeptide is substantially purified into 
20 an aqueous-based mixture. 

The method, in several embodiments, includes using polypeptides that may be 
selected from SEQ ID NO.: 2, SEQ ID NO.: 4, SEQ ID NO.: 6, SEQ ID NO.: 8, SEQ ID 
NO.: 9, and SEQ ID No.: 1 1 herein, although other amino acid sequences are also 
contemplated. 

25 In another embodiment of the present invention, the biological sample comprises 

many types of polypeptides, including, but not limited to, recombinant and non- 
recombinant polypeptides. Structural polypeptides, such as silk polypeptides, are also 
contemplated. 

In further embodiments of the present invention, organic acids are used to 
30 manipulate aqueous-based mixtures under conditions such that the mixtures may be 

processed into fibers. The organic acids that may be used include, but are not limited to, 
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formic, acetic, propionic, butyric, and valeric acids. The present invention further 
contemplates the product that is achieved by the methods that are described herein. 

While a variety of applications for the methods and products herein described are 
contemplated, the applications are not limited. For example, the compositions of the 
present invention may comprise any type of replacement for, or blended with, high 

strength light-weight synthetic polymers (e.g., kevlar®) for applications such as 
manufacture of skis, skateboards, and tennis rackets. The method of the present 
invention can also be used to create a product that can be used as a precursor to the 
construction of many materials, including, but not limited to, films, fibers, woven articles 
{e,g,, clothing), sutures, ballistic protection, parachutes and parachute cords. 



To facilitate an understanding of the invention, a number of Figures are included 
herein. 

Figure 1 presents the nucleic acid sequence of a recombinant silk protein 



Figure 2 presents a recombinant silk protein (SEQ ID NO: 2), designated 
pQE(SPl)^, that is the gene product of the nucleic acid sequence presented in (SEQ ID 

NO: 1). 

Figure 3 presents the nucleic acid sequence of a recombinant silk protein 
(SEQ ID NO: 3) designated pQE[(SPl)4/ (SP2)}]4- 

Figure 4 presents a recombinant silk protein (SEQ ID NO: 4), designated 
pQE[(SPl)4 / (SP2)i]4, that is the gene product of the nucleic acid sequence presented in 
(SEQ ID NO: 3). 

Figure 5 presents the nucleic acid sequence of a recombinant silk protein 
(SEQ ID NO: 5) designated pET[(SPl)4 /(SP2)jj4. 

Figure 6 presents a recombinant silk protein (SEQ ID NO: 6), designated 
pET[(SPl)4/SP2)i]4 that is the gene product of the nucleic acid sequence presented in 
(SEQ ID NO: 5). 

Figure 7 presents the nucleic acid sequence of a recombinant silk protein 
(SEQ ID NO: 7) designated pETNcDS, 




Dl^SCRIPTION OF THE DRAWINGS 



(SEQ ID NO: 1) designated pQE(spl)^. 
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Figure 8 presents a recombinant silk protein (SEQ ID NO: 8), designated 
pETNcDS , that is the gene product of the nucleic acid sequence presented in (SEQ ID 
NO: 7). 

Figure 9 presents a bacterial membrane protein (SEQ ID NO: 9), designated 



Figure 10 presents the nucleic acid sequence of a recombinant structural protein. 
(SEQ ID NO: 10) designated Recognin BL 

Figure 1 1 presents a recombinant structural protein (SEQ ID NO: 11), designated 
Recognin Bl, that is the gene product of the nucleic acid sequence presented in 
10 (SEQ ID NO: 10). 

Figure 12 presents a polyacrylamide gel comparing acid lysis purification of the 
recombinant silk protein pQE(Spl)7 to traditional denaturing method. The pQE(Spl)7 
protein is enriched by acid lysis compared to lysis under denaturing conditions (e.g. 8M 
urea). Subsequent affinity chromatography purification by Ni-NTA of the formic acid 
15 lysate results in a yield comparable to the purification of the traditional denaturing lysate. 

Figure 13 represents a polyacrylamide gel depicting the QAE-Sephadex 
purification scheme with a propionic acid extracted pET[(Spl)4/(Sp2)i]4 protein sample. 

Figure 14 presents a polyacrylamide gel depicting the purificafion of 
pET[(Spl)4/(Sp2)i]4 by lysis with propionic acid with 3M guanidine-HCl and ion- 
20 exchange chromatography using QAE-Sephadex A50. 

Figure 15 represents a polyacrylamide gel depicting the purification of ompF, a 
native E. coli structural protein from a lyophilized E. coli cell pellet. The cell pellet was 
extracted using valeric acid. This extraction procedure yielded a purity of approximately 
85% based on coomassie-blue staining. 
25 Figure 16 presents a polyacrylamide gel of Recognin Bl, a recombinant coiled 

coil structural protein. A cell pellet was lysed in either gel loading buffer, formic acid or 
valeric acid. Relative amounts of the cell pellet loaded onto the gel were 85, 400, 900 ug 
for the loading buffer, formic and valeric acid lysates, respectively. Acetic, propionic or 
butyric acids were unable to extract this protein. 
30 Figure 17 presents photomicrographs of a pETNcDS fiber spun from a protein 

solution of 25% (w/v) as determined by extinction coefficient. Fibers were generated at a 
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rate of at 10 ul/min in a 90% methanol coagulation bath. Consistent diameters of about 60 
urn were observed. Under polarizing light, the color changed uniformly from blue to 
yellow as the angle of light was changed indicating directional orientation in the fiber. 

Figure 18 presents photomicrographs of a fiber spun at a rate of 5 ul/min into a 90 
5 % methanol coagulation bath from a 12.5% aqueous solution of pQE[(Spl)4/(Sp2)i]4 
viewed under a) white light and b) polarized light with a tint plate. The fibers present a 
consistent diameter of about 30 um. 



DEFINITIONS 

10 To facilitate an understanding of the invention, a number of terms are defined. 

The term "aqueous", as defined herein, refers to a water miscible solution. 
The term "aqueous-based mixture", as defined herein, refers to a protein in an 

aqueous solution. The mixture may be used for protein purification, fiber spuming, film 

formation or other materials. 
15 The term "aqueous fiber spinning" refers to a process by which fibers are formed 

from an aqueous solution. 

The terms "spin" "spinnable" as used herein, refers to a mixture that is capable of 

forming a fiber and the fiber remains intact during manipulation (Le. drawing and 

removal from a coagulation bath). 
20 The term "biological sample", as defined herein, refers to any sample containing a 

structural protein of interest, native or recombinant, that is amenable to the methodology 

of the present invention. Examples of biological samples may include, but are not limited 

to, E. coli cells, other bacterial cells, eukaryotic cells, a medium where the structural 

protein has been secreted, bone, tissues or organs. 
25 The term "recombinant protein", as used herein, refers to the product produced by 

expression of a recombinant DNA sequence in a foreign host. The (Spl)7 protein, 

described herein in Example 1, exempHfies a recombinant protein. 

The term "recombinant" or "recombining" refers to a nucleic acid sequence which 

is incorporated into a vector, e.g., into. an aujonomously replicating plasmid or virus, or 
30 into the genomic DNA of a prokaryote or eukaryote, or which exists as a separate 

molecule {e.g., a cDNA or a genomic DNA fragment produced by PGR or restriction 
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endonuclease treatment) independent of other DNA sequences. This definition also 
includes recombinant DNA which is part of a hybrid gene encoding additional amino 
acid sequences. 

The term "recombinant DNA vector" as used herein refers to DNA sequences 
5 containing a desired coding sequence and appropriate DNA sequences necessary for the 
expression of the operably linked coding sequence in a particular host organism (e.g., 
bacteria). DNA sequences necessary for expression in prokaryotes include a promoter, 
optionally an operator sequence, a ribosome binding site and possibly other sequences. 
Eukaryotic cells are known to utilize promoters, polyadenylation signals and enhancers. 
10 The term "non-recombinant" refers to proteins that are derived by other than 

recombinant means. Non-recombinant protein may be structural or non-structural. The 
E. coli OmpF membrane protein (described herein in Example 6), which is, in this case, a 
naturally occurring protein that serves as an example of a non-recombinant protein. 



15 sample where the sample is frozen and dried under vacuum (-50^C & 10-100 microns of 

Hg) to produce a powder. 

The term "purified" or a "pure preparation" of a polypeptide, as used herein, 

means a polypeptide that has been separated from other proteins, lipids, and nucleic acids 

with which it naturally occurs. The polypeptide is also separated from substances, e.g., 
20 antibodies or gel matrix, e,g,, polyacrylamide, which are used to purify it. The term 

"substantially purified" polypeptide of the present invention constitutes at least 50%, and 

often above 90%, of the purified preparation as based on amino acid analysis. 

The term "acid" for the purposes of the present invention, refers to any organic 

acid that is capable of hydrolyzing contaminating proteins while allowing silk or other 
25 structural proteins to remain intact. Formic, acetic, propionic, butyric, and valeric acids 

are all examples of organic acids, although other acids are also contemplated. 

For the purposes of this invention, we define a "protein" as a polymer in which 

the monomers are amino acids and which are joined together through amide bonds and 

alternatively referred to as a polypeptide. The terms "protein" and "polypeptide" are 
30 herein used interchangeably. Standard abbreviations for amino acids are used {e.g., P for 

proline). These abbreviations are included in Stryer, Biochemistry, Third Ed., (1988), 



The term "lyophilized pellet" represents a sample that is derived from a biological 
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which is incorporated herein by reference for all purposes. 

The term "silk polypeptide" refers to a protein that approximates the molecular 
and structural profile of native silk proteins and fibers. 

The term "structural protein" or "structural polypeptide" refers to a class of non- 
5 catalytic proteins that may serve as a biological structural support. The proteins may 
serve as biological structural supports by themselves, in conjunction with other proteins, 
or as a matrix or support for other materials. Examples from this class include, but are 
not limited to, proteins such as spider silks ,that are used for spider web architecture; 
porin proteins, which form chaiuiels in biological membranes; keratin, the major 
10 structural component of hair; collagen, the major extracellular protein in connective 
tissue. The silks, OmpF and recognin proteins described herein are examples of 
structural proteins. 

The term "recovered" refers to the process by which protein is locally sequestered 
and captured. 

15 The term "organic acid" refers to the class of acids, such as formic, acetic, 

propionic, butyric, and valeric acids, which are found in living organisms but not 

necessarily, derived from said living organism. Said organic acids can also be obtained 

from commercial vendors (e.g. Sigma Chemical). 

As used herein, the term "host cell" refers to any cell capable of expressing a 

20 functional gene and/or gene product introduced from another cell or organism. This 

definition includes E. coli., as well as other organisms. 

The term "insoluble fibers" refers to proteinaceous fibers that will not solubilize 

in an aqueous solution. 

The term "bacterial" refers to any of numerous groups of microscopic, one-celled 
25 organisms including, but not limited to the phylum Eubacteria of the kingdom 
Procaryotae. 

The term "concentrating" refers to any process that increases the molarity of 
proteinaceous solution. 

The term "concentrated solution" refers to a proteinaceous solution adjusted to a 
30 predetermined molarity higher than said pre-adjusted proteinaceous solution. 

The tenri "spinneref refers to a small orifice used for fiber formation. 
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DESCRIPTION OF THE INVENTION 

A number of different embodiments, as exemplified in the examples, of the 
5 present invention are contemplated, including the scaling-up of the method, automation 
of the method, or use of the method to purify other structural proteins. 

One of skill in the art will recognize that the practice of the present invention will 
employ, unless otherwise indicated, conventional techniques of cell biology, cell culture, 
molecular biology, microbiology, and recombinant DNA, which are within the skill of 
10 the art. Such techniques are described in the literature. See, for example, Molecular 
Cloning, A Laboratory, Manual, 2nd Ed., by Sambrook, Fritsch and Maniatis (Cold 
Spring Harbor Laboratory Press: 1989); DNA Cloning, Volumes I and II (D. N. Glover 
■^3 ed., 1985); Oligonucleotide Synthesis (M. J. Gait ed., 1984); Nucleic Acid Hybridization 

-J 

(B. D. Hames & S. J. Higgins eds. 1984); Transcription And Translation (B. D. Hames & 
% 15 S. J. Higgins eds. 1984); Culture Of Animal Cells (R. I. Freshney, Alan R. Liss, Inc., 



1987); Immobilized Cells And Enzymes (IRL Press, 1986); B. Perbal, A Practical Guide 
To Molecular Cloning (1984); the treatise, Methods In Enzymology (Academic Press, 
Inc., N.Y.); Gene Transfer Vectors For Mammalian Cells (J. H. Miller and M. P. Calos 
eds., 1987, Cold Spring Harbor Laboratory); Methods In Enzymology, Vols. 154 and 155 



The proteins of the present invention can be made by direct synthesis (chemically 
or biologically) or by expression fi"om cloned DNA. The source of the protein is not 
limited to recombinant means. Non-recombinant proteins may be purified or spun using 
the methods described herein. Indeed, Example 6, infra, describes the purification of E. 

25 coli OmpF membrane protein, which is, in this case, a naturally occurring (i.e. non- 
recombinant protein) protein. 

The means for expressing cloned DNA are generally known in the art. However, 
there are some considerations for design of expression vectors that are unusual for 
expressing DNA encoding the spider silk proteins of the present invention. For example, 

30 the proteins are highly repetitive in their structure. Accordingly, cloned DNA should be 
propagated and expressed in host cell strains that will maintain repetitive sequences in 
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(Wu et al. eds.). 




extrachromosomal elements {e.g. SURfiTM cells, Stratagene). Also, due to the high 
content of alanine, glycine, proline, and glutamine, it might be advantageous to use a host 
cell which over expresses tRNA for these amino acids. 

The present invention contemplates the use of many different organic acids to 
5 manipulate recombinant and non-recombinant biological samples under conditions such 
that a polypeptide is substantially purified. While the use of E. coli cells with formic, 
propionic and valeric acid are contemplated, the present invention is not limited to these 
particular embodiments, but may also be practiced using other organic acids, such as 
acetic, and butyric, acids, all of which serve as examples. The present invention may also 
10 be practiced using other prokaryotic or eukaryotic cells (aside from, or along with, E. coli 
cells), the media in which the protein-of-interest has been secreted, organs, tissue, bone 
and other components, all of which are examples of biological sample materials. 

EXPERIMENTAL 

15 The following examples serve to illustrate certain embodiments and aspects of the 

present invention and are not to be construed as limiting the scope thereof. 

EXAMPLE 1 
Purification of Recombinant Silk Protein 
20 with Formic Acid and Ion Exchange Chromatography 

In this example, the gene product of pQE(spl)7 (SEQ ID NO: 1), as set out in 
Figure 1, is expressed as recombinant silk protein pQE(SPl)^ (SEQ ID NO: 2), as set out 

in Figure 2, in E.coli as described elsewhere {Prince et al, 1995). The (spl)? gene was 
cloned into the expression vector pQE-9 (Qiagen) and transformed into the host cell line 

25 SG13009pREP4 (Stratagene). Cultures were grown to an Aeoo = 1 .5-2.0 in 4xYT 
medium (per liter: 32 g tryptone, 20 g yeast extract, 5 g NaCl) containing 400ug/mL 
ampicillin. Protein expression was induced by the addition of isopropyl- 
thiogalactopyranoside (IPTG) to a final concentration of ImM. After 1-4 hours the cells 
were harvested by centrifugation and stored for purification. LyophiUzed pellets were 

30 lysed in 23N formic acid (100 ml/g cell pellet), diluted to 4.6N acid with distilled and 
deionized water and stirred 1 hour at room temperature. The cell lysate was clarified by 
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centriftigation and concentrated 20 fold by ultrafiltration. The solution was clarified by 
centrifugation and the supernatant was dialyzed extensively into 8M urea, 10 mM 
NaH^PO^, ImM Tris, 20 mM NaCl, pH 8. Precipitated material was removed by 
centrifugation and the clarified supernatant was applied to an affinity chromatography 
5 resin (nickel-NTA agarose) that had been equilibrated with 8M urea, 10 mM NaH^PO^, 

ImM Tris, 20 mM NaCl, pH 8. The chromatography conditions were designed to bind 
the recombinant silk protein, but let the remaining bacterial proteins pass through the 
column. The column was washed with 8M urea, 10 mM NaHjPO^, ImM Tris, 20 mM 
NaCl, pH 7. The pQE(SPl)7 protein was eluted from the column 8M urea, 10 mM 
10 NaH2P04, ImM Tris, 20 mM NaCl, pH 3. The sample was 94% pure as determined by 
quantitative amino acid analysis. Figure 12 illustrates a comparison of traditional 
purification techniques with the methodology enclosed in this application. Cells lysed 
with formic acid yielded more silk protein with a similar purity when compared to the 
6M guanidine lysis with Ni-NTA affinity chromatography. 

15 

EXAMPLE 2 
Purification of Recombinant Silk Protein with 
Propionic Acid and Ion Exchange Chromatography 

In this example, the gene product of pQE[(SP 1)4 / (SP2) J 4 (SEQ ID NO: 3), as 
20 set out in Figure 3, is expressed as recombinant silk protein pQE[(SPl)^ / (SP2) (SEQ 

ID NO: 4), as set out in Figure 4, in E.coli (Prince et al, 1995). The f(SP])4 / (SP2)jJ^ 
gene was cloned into the expression vector pQE-9 (Qiagen) and transformed into the host 
cell SG13009pREP4 (Stratagene). Cultures were grown to an A600 = 1.5-2.0 in 4xYT 
medium (per liter: 32 g tryptone, 20 g yeast extract, 5 g NaCl) containing 400ug/mL 

25 ampicillin. Protein expression was induced by the addition of IPTG to a final 

concentration of ImM. After 1-4 hours the cells were harvested by centrifugation and 
stored for purification. Lyophilized pellets were lysed in 13. 3N propionic acid (2 ml/g 
cell pellet), diluted to 2.3N acid with distilled and deionized water and stirred 1 hour at 
room temperature. The cell lysate was clarified by centrifiigation and concentrated 20 

30 fold by ultrafiltration. Many of the acid stable proteins became insoluble and were 
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removed by centrifugation. The clarified supernatant was dialyzed extensively into 10 
mM Tris, pH 9 containing 2M urea. The dialyzed solution was applied to a strong anion 
exchange resin, QAE-Sephadex A50, that had been equilibrated with 10 mM Tris, pH 9 
containing 2M urea. The chromatography conditions were designed such that the 
5 positively charged silk protein would not bind to the column, but the remaining proteins 
with lower isoelectric points and net negative charge would bind to the column. The 
column was washed with 10 mM Tris, pH 9 containing 2M urea to recover any remaining 
silk protein. The wash was pooled with the unbound silk containing fraction and 
processed. The sample was 97% pure as determined by quantitative amino acid analysis. 



NO: 6), as set out in Figure 6, in E.coli {Prince et aL, 1995). The [(SP1)4/(SP2) gene 
was cloned into the expression vector pET24 (Novagen Inc.) and transformed into the 
host cell BL2l(DE3) pLysS. Cultures were grown to an A6oo = 19 in defined medium 
(per liter: 13.3 g KH2PO4, 4 g (NH4)2HP04, 1.7 g Citric acid, 25 g glucose, 1.2 g MgS04- 

20 7H2O, 39 mg FeCb, 13 mg MnS04-H20, 10 mg ZnS04-7H20, 3 mg H3BO3, 2.5 mg 

Na2Mo04-2 H2O, 2.5 mg C0CI2-6H2O, 1.8 mg Cu(CH3COO)2- H2O, 6.7 mg EDTA, 4.5 
mg thiamine-HCl) with kanamycin (30 ug/ml) at 3TC, 16 liter/min air and 600 rpm. 
Expression was induced for 1 hr with 1 mM IPTG at which time the cells were harvested 
by centrifugation and stored for purification. LyophiHzed pellets were lysed in 13. 3N 

25 propionic acid (2 ml/g cell pellet), diluted to 2.3N acid with distilled and deionized water 
and stirred 1 hour at room temperature. The cell lysate was clarified by centrifugation 
and concentrated 20 fold by ultrafiltration. Many of the acid stable proteins became 
insoluble and were removed by centrifiigation. The clarified supernatant was dialyzed 
extensively into 10 mM Tris, pH 9 containing 2M urea. The dialyzed solution was 

30 applied to a strong anion exchange resin QAE-Sephadex A50 that had been equilibrated 
with 10 mM Tris, pH 9 containing 2M urea. The chromatography conditions were 
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EXAMPLE 3 



Purification of Recombinant Silk Protein witti 



Propionic Acid and Ion Exchange Chromatography 

In this example, the gene product oipET [(SP1)4 / (SP2)i]4 (SEQ ID NO: 5), as 
15 set out in Figure 5, is expressed as recombinant silk protein pET[(SPl)4/(SP2) (SEQ ID 
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designed such that the positively charged silk protein would not bind to the column, but 
the remaining proteins with lower isoelectric points and net negative charge would bind 
to the column. The column was washed with 10 mM Tris, pH 9 containing 2M urea to 
recover any remaining silk protein. The wash was pooled with the unbound silk 
5 containing fraction and processed. The sample was 75-85% pure as determined by 
coomassie-blue staining of a polyacrylamide gel (see Figure 13). 



In this example, the gene product of pET[(SPl)4 / (SP2) J 4 (SEQ ID NO: 5), as set 
out in Figure 5, is expressed as recombinant silk protein pET[(SPl)4 / (SP2)i]4 (SEQ ID 
NO: 6), as set out in Figure 6, in E.coli (Prince et al., 1995). Lyophilized pellets were 
lysed in 13.3N propionic acid (2 mL/g cell pellet), diluted to 2.3N acid with 6M 

15 guanidine hydrochloride (to a final concentration of 3M) and distilled and deionized 
water and stirred for 1 hour at room temperature. The cell lysate was clarified by 
centrifugation and concentrated 3 fold by ultrafiltration. Precipitated material was 
removed by centriftigation and the clarified supernatant was dialyzed extensively into 10 
mM Tris, pH 9 containing 2M urea. Many of the acid stable proteins became insoluble 

20 and were removed by centriftigation. The dialyzed supernatant was applied to a strong 
anion exchange resin, QAE-Sephadex A50 that had been equilibrated with 10 mM Tris, 
pH 9 containing 2M urea. The chromatography conditions were designed such that the 
positively charged silk protein would not bind to the column, but the remaining proteins 
with lower isoelectric points and net negative charge would bind to the column. The 

25 column was washed with 10 mM Tris, pH 9 containing 2M urea to recover any remaining 
silk protein (see Figure 14). The wash was pooled with the unbound silk containing 
fraction and processed as describe in example 9. This sample was approximately 80% 
pure based on coomassie blue staining. 



EXAMPLE 4 



10 



Puriflcation of Recombinant Silk protein with Propionic Acid 
Containing Denaturant and Ion Exchange Chromatography 
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EXAMPLE 5 



Purification of Recombinant Silk Protein with 



Formic Acid Containing Denaturant and Affinity Chromatography 

In this example, the gene product ofpETNcDS (SEQ ID NO: 7), as set out in 



5 Figure 7, is expressed as recombinant silk protein pETNcDS (SEQ ID NO: 8), as set out 
in Figure 8, in E.coli (Arcidiacono et al 1998). The NcDS gene was cloned into the 
expression vector pET24 (Novagen Inc.) and transformed into the host cell BL21(DE3) 
pLysS. Cultures were grown to an A^oo = 4 in 4xYT medium (per liter: 32 g tryptone, 
20 g yeast extract, 5 g NaCl) with kanamycin (30 ug/ml)at 37°C, 1 liter/min air and 800 

10 rpm. Expression was induced for 3 hr with 1 mM IPTG at which time the cells were 

harvested by centrifiigation and stored for purification. Lyophilized pellets were lysed in 
23N formic acid (5 ml/g cell pellet), diluted to 2.3N acid with 6M guanidine 
hydrochloride (to a final concentration of 3M) and distilled and deionized water and 
stirred 1 hour at room temperature. The cell lysate was clarified by centrifiigation and 

15 concentrated 20 fold by ultrafiltration. The solution was clarified by centrifiigation and 
the supernatant was dialyzed extensively into 8M urea, 10 mM NaH^PO^, ImM Tris, 20 

mM NaCl, pH 8. Precipitated material was removed by centrifiigation and the clarified 
supernatant was applied to an affinity chromatography resin (nickel-NTA agarose) that 
had been equilibrated with 8M urea, 10 mM NaH^PO^, ImM Tris, 20 mM NaCl, pH 8. 

20 The chromatography conditions were designed to bind the recombinant silk protein, but 
let the remaining bacterial proteins pass through the colunrn. The column was washed 
with 8M urea, 10 mM NaH^PO^, ImM Tris, 20 mM NaCl, pH 7. The NcDS protein was 

eluted from the column 8M urea, 10 mM NaH^PO^, ImM Tris, 20 mM NaCl, pH 3. The 

purified protein could then be processed for fiber spinning as in Example 8. 



Valeric Acid Purification of coli OmpF Membrane Protein 

In this example, a native E. coli ompF membrane protein (SEQ ID NO: 9), as 
presented in Figure 9, was purified. Cells were grown and harvested as described in 
30 Example 3. Because OmpF is a native E. coli protein, its production was not induced by 
the addition of IPTG. Lyophilized pellets were lysed in 9.2N valeric acid (2 mL/g of 
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pellet), diluted to 2.3N acid with distilled and deionized water and stirred for 1 hour at 
room temperature. The cell lysate was clarified by centrifugation and applied to an SDS 
polyacrylamide gel for electrophoresis. Figure 15 represents the polyacrylamide gel 
depicting this purification of ompF, a native E, coli structural protein from a lyophilized 
5 E, coli cell pellet. The ompF protein was than blotted onto a nitrocellulose membrane for 
N-terminal sequencing. The resulting 30 amino acids of N-terminal sequence led to the 
identification of E. coli outer membrane protein, ompF. This simple extraction procedure 
yielded a purity of approximately 85% based on coomassie-blue staining. 

10 EXAMPLE? 

Organic Acid Extraction of Recognin Bl Protein 

In this example, the gene product o{ Recognin Bl (SEQ ED NO: 10), as set out in 
Figure 10, was expressed as recombinant synthetic coiled protein Recognin Bl (SEQ ID 
NO: 1 1), as set out in Figure 11, in Exoli (McGrath, K.P. and Kaplan, D.L, Mat. Res. 

15 Symp. Proc. 292, 83-91). The Recognin Bl gene was cloned into the expression vector 
pQE-9 (Qiagen) and transformed into the E. coli host cell, SG13009pREP4 (Qiagen). 
Cultures were grown to an Am of 1 in 4xYt medium (per liter: 32 g tryptone, 20 g yeast 
extract, 5 g NaCl) with ampicillin (400 ug/mL) and kanamycin (50 ug/mL). Expression 
was induced for two hours with 1 mM IPTG at which time the cells were harvested by 

20 centrifugation and stored for purification. Individual lyophilized pellets were lysed 
separately in 23N formic acid, 17.5N acetic acid, 13.4N propionic acid, 10.9N butyric 
acid or 9.2N valeric acid (2 mL/g of pellet), diluted to 2.3N acid with distilled and 
deionized water and stirred for 1 hour at room temperature. The cell lysates were 
clarified by centrifugation and analyzed by SDS-PAGE. Figure 16 presents the 

25 polyacrylamide gel of Recognin Bl, a recombinant coiled coil structural protein. A cell 
pellet was lysed in either gel loading buffer, formic acid or valeric acid. Relative 
amounts of the cell pellet loaded onto the gel were 85, 400, 900 ug for the loading buffer, 
formic and valeric acid lysates, respectively. Acetic, propionic or butyric acids were 
unable to extract this protein. The results indicated that formic and valeric acids were 

30 able to extract a significant quantity of Recognin Bl from E. coli pellets. The extracted 
protein did not appear to be degraded upon exposure to these organic acids. Of the two 



19 



• # 



acids, valeric acid was able to extract Recognin Bl in a relatively pure form. 



EXAMPLE 8 



5 



Processing and Fiber Spinning of Recombinant Silk Protein 

Recombinant pETNcDS protein was purified as in Example 5, concentrated 100- 



fold by ultrafiltration and dialyzed into 10 mM NaH^PO^, ImM Tris, 20 mM NaCl, pH5 

containing IM urea. The dialyzed sample was clarified by centrifugation and 
concentrated by ultrafiltration to a 25% (w/w) solution for fiber spinning. A Harvard 
Apparatus InfiisionAVithdrawal Pump (Harvard Instruments, Natick MA) was used with 
10 a specialized microspinner (cavity volume 0.5 ml), and a 6 cm (0.005 I.D.) piece of 

tubing which was used as a spinneret. The silk solution was forced through the spinneret 
at a rate of 5-10 ul/min into a coagulation bath consisting of 90% methanol. Water 
insoluble fibers, 10-60 um in diameter, were produced and prepared for light microscopy 
(see Figure 17). 



Processing and Fiber Spinning the pET[(Spl)4/(Sp2)i]4 Recombinant Silk Protein 

pET[(SPl)y(SP2)j]^ (SEQ ID NO: 6) was purified as described in Example 4. 

The sample wasclarified by centrifugation and concentrated by ultrafiltration to 9.3% 
20 (w/w) solution for fiber spinning. A Harvard Apparatus Infiision/Withdrawal Pump 
(Harvard Instruments, Natick MA) was used with a specialized microspinner (cavity 
volume 0.5 ml) and a 6 cm (0.005 I.D.) piece of tubing which was used as a spinneret. 
The silk solution was forced through the spinneret at a rate of 2-5 ul/min into a 
coagulation bath consisting of 90% methanol. Fibers were produced from the solution. 
25 Fibers from the 9.3% solution were removed from the coagulation bath: said fibers were 
water insoluble and were subsequently prepared for light microscopy. 
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EXAMPLE 10 

Processing and Fiber Spinning the pQE[(Spl)4/(Sp2)i]4 Recombinant Silk Protein 

The pQE[(Spl)4/(Sp2)i]4 protein was purified by lysis in formic acid/guanidine 
hydrochloride as in Example 5 and dialyzed into 10 mM NaH2P04, 1 mM Tris, 20 mM 
5 NaCl, pH 5 containing 160 mM urea. The dialyzed sample was clarified by 

centrifugation and concentrated by ultrafiltration to 6.5% and 12.5% (w/w) solution for 
fiber spinning. A Harvard Apparatus InfusionAVithdrawal Pump (Harvard Instruments, 
Natick MA) was used with a specialized microspinner (cavity volume 0.5 ml) and a 6 cm 
(0.005" I.D.) piece of tubing was used as a spinneret. The silk solution was forced 
10 through the spinneret at a rate of 5-10 ul/min into a coagulation bath consisting of 90% 
methanol. Fibers were produced from each solution. Only fibers from the 12.5% 
solution could be removed from the coagulation bath; they were water insoluble and 
prepared for light microscopy (see Figure 18). 

15 From the above description and examples, it should be clear that the present 

invention provides improved methods for purifying structural proteins and spinning 
spider silk proteins. Accordingly, this invention is not limited to the particular 
embodiments disclosed, but is intended to cover all modifications that are within the 
spirit and scope of the invention as defined by the appended claims. 
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